Integrating Syntactic and Semantic Tools in Sfy

نویسنده

  • Paul M. Heider
چکیده

Despite the large lexicon of even broad coverage Natural Language Processing systems, there are many missing lexical items. We describe two methodologically distinct approaches to augment the lexicon: the look-up approach and the shotgun approach. Both approaches are framed within Sfy, a new research program to pull together several broad coverage systems. The classic look-up approach requires compatible electronic sources (e.g. WordNet). Through system hooks into the source, we can pull as much relational and semantic information as Sfy requires. Since WordNet does not overtly provide Sfy with sufficient syntactic role and semantic information, we use the synonyms of an unknown target word that WordNet provides us with to create templates for a new entry in our lexicon. In a fully realistic model, we cannot always rely on the lookup approach to solve our lexical issues. We need another back-off method. Much like a person intuiting the part-ofspeech of a new term, an unknown word presents a syntactic hole to our parser. Only certain parts of speech will fill that hole. We can try to validate all the possible fillers by naively testing every part of speech. The subset of all these sentences that can be parsed informs us of exactly which parts of speech our unknown word can be. Interestingly, the shotgun approach also provides a means to solve the related problem of a familiar orthogrpahic word functioning in an unfamiliar part of speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Verbs in Applied Linguistics Research Article Introductions: Semantic and syntactic analysis

This study aims to investigate the semantic and syntactic features of verbs used in the introduction section of Applied Linguistics research articles published in Iranian and international journals. A corpus of 20 research article introductions (10 from each journal) was used. The corpus was analysed for the syntactic features (tense, aspect and voice) and semantic meaning of verbs. The finding...

متن کامل

Verbs in Applied Linguistics Research Article Introductions: Semantic and syntactic analysis

This study aims to investigate the semantic and syntactic features of verbs used in the introduction section of Applied Linguistics research articles published in Iranian and international journals. A corpus of 20 research article introductions (10 from each journal) was used. The corpus was analysed for the syntactic features (tense, aspect and voice) and semantic meaning of verbs. The finding...

متن کامل

برچسب‌زنی نقش معنایی جملات فارسی با رویکرد یادگیری مبتنی بر حافظه

Abstract Extracting semantic roles is one of the major steps in representing text meaning. It refers to finding the semantic relations between a predicate and syntactic constituents in a sentence. In this paper we present a semantic role labeling system for Persian, using memory-based learning model and standard features. Our proposed system implements a two-phase architecture to first identify...

متن کامل

برچسب‌زنی خودکار نقش‌های معنایی در جملات فارسی به کمک درخت‌های وابستگی

Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...

متن کامل

Manipulation in advertising text: lexical and semantic aspect

The present paper focuses on the questions of modern advertising science, structure of advertising and elements making actual manipulative influence from the addresser. Advertising encourages product sales, is an instrument of forming ethical standards, values, creating cultural values, standards and mode of behavior that is why the wide system of means for achieving aims of advertisers is need...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008